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Abstract 

We consider the general question of estimating decay of correlations 
for non-uniformly expanding maps, for classes of observables which are 
much larger than the usual class of Holder continuous functions. Our 
results give new estimates for many non-uniformly expanding systems, 
including Manneville-Pomeau maps, many one-dimensional systems with 
critical points, and Viana maps. In many situations, we also obtain a 
Central Limit Theorem for a much larger class of observables than usual. 

Our main tool is an extension of the coupling method introduced by 
L.-S. Young for estimating rates of mixing on certain non-uniformly ex- 
panding tower maps. 

1 Introduction 

In this paper, we are interested in mixing properties (in particular, decay of 
correlations) of non-uniformly expanding maps. Much progress has been made 
in recent years, with upper estimates being obtained for many examples of 
such systems. Almost invariably, these estimates are for observables which are 
Holder continuous. Our aim here is to extend the study to much larger classes 
of observables. 

Let / : {X, v) O be some mixing system. We define a correlation function 



{(f o f"')ipdi' — / Lpdv / ipdiy 



for ip,ip e C^. The rate at which this sequence decays to zero is a measure 
of how quickly ip o f" becomes independent from ip. It is well known that 
for any non-trivial mixing system, there exist ip,ip C'^ for which correlations 
decay arbitrarily slowly. For this reason, we must restrict at least one of the 
observables to some smaller class of functions, in order to get an upper bound 
for C„. 

Here, we present a result which is general in the context of towers, as in- 
troduced by L.-S. Young ( |Yo| ) . There are many examples of systems which 
admit such towers, and we shall see that under a fairly weak assumption on 
the relationship between the tower and the system (which is satisfied in all the 
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examples wc mention) we get estimates for certain classes of observables with 
respect to the system itself. One of the main strengths of this method is that 
these classes of observables may be defined purely in terms of their regularity 
with respect to the manifold; this contrasts with some results, where regularity 
is considered with respect to some Markov partition. 

Acknowledgements. This paper is based on work contained in my Ph.D. 
thesis, written at the University of Warwick. Many thanks to my supervisor, 
Stefano Luzzatto, and also to Omri Sarig and Peter Walters. I am grateful 
for many conversations with Mark Holland, Frederic Paccaut, Gavin Band and 
Mike Todd. 

2 Statement of results 

Let us start by defining the classes of observables we consider. Let (X, d) be a 
metric space. For a given function ^ : X — > R, for each e > we write 



We see that TZ^ {tp) ^ as £ ^ if and only if ip is uniformly continuous. We 
define classes of functions corresponding to different rates of decay for TZe{tp), 
as follows: 

Class (m,7), 7e (0,1]; e (Rl,j) if W = 

Class {R2,j),-fe (0,1); iR^,l) if 7e,(?A) = 0(cxp{-| logej''}). 

Class (i?3,7), 7 > 1; ^ G (i?3,7) if 7^,(^) = O(cxp{-(log | loge|)T}). 

Class (i?4,7), 7 > 1; V e (i?4,7) if 7^e(V') = 0(1 log ^^l"^)- 

We write (Rn) = U-y(i?n, 7) for each n. Note (Rl) is the class of functions 
which are Holder continuous. Also note that (Rl) C (-R2) C C (i?4) with 
the inclusions strict, and in fact {Rn, 7) D {R{n + 1)) for every n = 1, 2, 3, with 
7 in the appropriate interval. It is not difficult to find observables with exactly 
any of these regularities; for instance, the function ^ : [0, 5] ^ M given by 



for some 7 > 1, has the property that TZe{tjj) = tjj{e) for all small e. 

Before we state the exact technical result, we shall illustrate its implications 
by stating results for a number of different classes of system. In each case, 
the estimate given for Holder continuous observables (i.e. case (Rl)) has been 
obtained previously, but we are able to give estimates for observables in each 
of the other classes, most of which are new. Often the same estimates given 
in the Holder case also apply to one of the larger classes; it seems that when 
the rate of mixing for Holder observables is sub-exponential, we can expect the 
same rate of mixing to hold for a larger class of observables. For instance, when 
Holder observables give polynomial mixing, we tend to get the same speed of 
mixing for observables in some class (R4,7), 7 > 1. 



7^e(■0) := sup{|-0(a;) - V-Cy)! : d{x,y) < e}. 




x>0 
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All of our results shall take the following form. Given a system / : X O, a 
mixing acip v, and (f G £°°{X, v), tp £ 1, for some class 1 = (Ri,^) as above, 
we obtain in each example an estimate of the form 

C„{ip,ip;u) < \\ip\\ooC{i>)Un, 

where || • \\oo is the usual norm on £°°{X,i/), C{ip) is a constant depending 
on / and tp, and (u„) is some sequence decaying to zero with rate determined 
by / and TZ^{ip). Notice that wc make no assumption on the regularity of the 
observable (p; when discussing the regularity class of observablcs, wc shall always 
be referring to the choice of the function ip. (This is not atypical, although some 
existing results do require that both functions have some minimum regularity.) 

For brevity, we shall simply give an estimate for m„ in the statement of each 
result. For each example we also have a Central Limit Theorem for those ob- 
servablcs which give summable decay of correlations, and are not coboundarics. 
We recall that a real-valued observable ^jj satisfies the Central Limit Theorem 
for / if there exists cr > such that for every interval J C M, 

Note that the range of examples given in the following subsections is meant 
to be illustrative rather than exhaustive, and so we shall miss out some simple 
generalisations for which essentially the same results hold. We shall instead 
try to make clear the conditions needed to apply these results, and direct the 
reader to the papers mentioned below for further examples which satisfy these 
conditions. 

2.1 Uniformly expanding maps 

Let / : M O be a C^-diffcomorphism of a compact Ricmannian manifold. We 
say / is uniformly expanding if there exists A > 1 such that ||£)/a;t'|| > X\\v\\ 
for all a; e M, and all tangent vectors v. Such a map admits an absolutely 
continuous invariant probability measure fi, which is unique and mixing. 

Theorem 1 Let (p € £^{AI,fi), and let : M ^ M. be continuous. Upper 
bounds are given for as follows: 

• ifijjG {Rl), then Un = C(6I") for some 6 G (0, 1); 

• iftjjG (i?2,7), for some 7 e (0, 1), then Un = 0(e~" ) for every 7 < 7; 

• if ^ G (i?3,7), for some 7 > 1, then u„ = ©(e^^'"^")^ ) for every 7' < 7; 

• for any constant Coo > there exists C < 1 such that if ip € (i?4, 7) for 
some 7 > and TZoo{tp) < (^00, then Un = 0{n^~^^). 
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Furthermore, the Central Limit Theorem holds when ijj S (-R4, 7) for suffi- 
ciently large 7, depending on TZooitp)- 

Such maps are generally regarded as being well understood, and in particu- 
lar, results of exponential decay of correlations for observables in (i?l) go back 
to the seventies, and the work of Sinai, Ruelle and Bowen ([SIl, |Boj ) . For a 
more modern perspective, see for instance the books of Baladi (' jBap and Viana 

I have not seen explicit claims of similar results for observables in classes 
(i?2 — 4). However, it is well known that any such map can be coded by a 
one-sided full shift on finitely many symbols, so an analogous result on shift 
spaces would be sufficient, and may well already exist. The estimates here are 
probably not sharp, particularly in the (i?4) case. 

The other examples we consider are not in general reducible to finite alphabet 
shift maps, so we can be more confident that the next set of results are new. 

2.2 Maps with indifferent fixed points 

These are perhaps the simplest examples of strictly non-uniformly expand- 
ing systems. Purely for simplicity, we restrict to the well known case of the 
Manneville-Pomeau map. 

Theorem 2 Let f : [0, 1] O be the map f{x) — x + a;^"*"" (mod 1), for some 
a G (0, 1), and let v he the unique acip for this system. For ip, : [0, 1] R 
with If bounded and ijj continuous, for every constant Ceo > there exists C < 1 
such that if ip & (-^^4, 7) for some 7 > 2^^^, with TZoo{'4') < Coo, then 

* "if 1 = C^^('^+ 1); then Un = 0{n^^'^ logn); 

• otherwise, Un — 0{max{n^~'^ ,n'^^'''')); 

where r = a~^. In particular, when 7 > | the Central Limit Theorem holds. 

In the case where ^ £ (^4,7) for every large 7, this gives u„ — 0{n^^i), 
which is the bound obtained in jYoj for ip £ (-R1). We do not give separate 
estimates for observables in classes (i?2) and (i?3), as we obtain the same upper 
bound in each case. Note that the polynomial upper bound for (Rl) observables 
is known to be sharp f |Hup . and hence the above gives a sharp bound in the 
(i?2) and (i?3) cases, and for (i?4, 7) when 7 is large. 

The above results apply in the more general 1-dimensional case considered 
in [Yoj . where in particular a finite number of expanding branches are allowed, 
and it is assumed that xf"{x) « x" near the indifferent fixed point. 

In our remaining examples, estimates will invariably correspond to either 
the above form, or that of Theorem ^ and we shall simply say which is the 
case, specifying the parameter t as appropriate. 
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2.3 One-dimensional maps with critical points 

Let us consider the systems of |BLS| . These are one-dimensional multimodal 
maps, where there is some long-term growth of derivative along the critical 
orbits. Let / : / ^ / be a C'^ interval or circle map with a finite critical set C 
and no stable or neutral periodic orbit. We assume all critical points have the 
same critical order I g (l,oo); this means that for each c G C, there is some 
neighbourhood in which / can be written in the form 



for some diffeomorphism : M ^ K fixing 0, with the ± allowed to depend on 
the sign of a; — c. 

For c e C, let D„(c) = |(/")'(/(c))|. From fST^ we know there exists an 
acip /i provided 



If / is not renormalisable on the support of /i then is mixing. 

Theorem 3 Let ip £ C°°{I,i.i), and let ip be continuous. 

Case 1: Suppose there exist C > 0, X > 1 such that Dn{c) > CA" for all 
n > 1, c d C . Then we have estimates for (u„) exactly as in the uniformly 
expanding case (Theorem^. 

Case 2: Suppose there exist C > 0, a > 21 — I such that Dn{c) > Cn" for 
all n > I, c e C. Then we have estimates for (u„) as in the indifferent fixed 
point case (Theorem\^ for every r < jrj-. 

In particular, the Central Limit Theorem holds in either case when ip G 
(i?4,7) for sufficiently large 7, depending on TZoo{tp)- 

Again, we have restricted our attention to some particular cases; analogous 
results should be possible for the intermediate cases considered in |BLS| . In 
particular, for the class of Fibonacci maps with quadratic critical points (see 
[ LMp we obtain estimates as in Theorem |21 for every t > 1. 

2.4 Viana maps 

Next we consider the class of Viana maps, introduced in |Vlj . These are ex- 
amples of non-uniformly expanding maps in more than one dimension, with 
sub-exponential decay of correlations for Holder observables. They are notable 
for being possibly the first examples of non-uniformly expanding systems in 
more than one dimension which admit an acip, and also because the attractor, 
and many of its statistical properties, persist in a neighbourhood of systems. 

Let flo be some real number in (1, 2) for which a; = is pre-periodic for the 
system x 1-^ oq — x'^ . We define a skew product / : 5*^ x R by 



f{x)^±\^{x~c)\'+f{c) 




71 



f{s, x) — {ds mod 1, oq + a sin(27rs) — x^). 
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where d is an integer > 16, and a > is a constant. When a is sufficiently small, 
there is a compact interval / C (—2, 2) for which 5*^ x / is mapped strictly inside 
its own interior, and / admits a unique acip, which is mixing for some iterate, 
and has two positive Lyapunov exponents f|Vl | . pX^ '). The same is also true for 
any / in a sufficiently small neighbourhood A/" of /. 

Let us fix some small a, and let A/" be a sufhcicntly small neighbourhood 
of / such that for every f £ N the above properties hold. Choose some / G M] 
if / is not mixing, we consider instead the first mixing power. 

Theorem 4 For ip g C^{S^ x K, i^), i/' G (^4,7), we have estimates for (u„) 
as in the indifferent fixed point case (Theorem\^ for every t > 1. 

The Central Limit Theorem holds for ip € 7) when 7 is sufficiently 
large, depending on TZao{ip). 

Another way of saying the above is that if if) G (-R4, 7), then it„ = 0{n'^~'^'^), 
with the usual dependency of Q on TZooifp)- Note that for observables in n^>i(i?4, 7), 
we get super-polynomial decay of correlations, the same estimate as we obtain 
for Holder observables (though Baladi and Gouezel have recently announced a 
stretched exponential bound for Holder observables - see BG). 

There are a number of generalisations we could consider, such as allowing 
D > 2 f |BST| - note they require / to be C°° close to /), or replacing sin(27rs) 
by an arbitrary Morse function. 

2.5 Non-uniformly expanding maps 

Finally, we discuss probably the most general context in which our methods can 
currently be applied, the setting of |ALP| . In particular, this setting generalises 
that of Viana maps. 

Let / : M ^ M be a transitive local diffeomorphism away from a singu- 
lar/critical set S, with M a compact finite-dimensional Riemannian manifold. 
Let Leb be a normalised Riemannian volume form on M, which we shall refer 
to as Lebesgue measure, and d a Riemannian metric. We assume / is non- 
uniformly expanding, or more precisely, there exists A > such that 

n— 1 

hminf - V log \\Df-,\ \\-' > A > 0. (1) 
For almost every a; in M, we may define 

£(x) = min I ' ^ ll^/7'(.)ll"' ^ V2, Vn > A^j . 

The decay rate of the sequence Leb{£(x) > n} may be considered to give a 
degree of hyperbolicity. Where S is non-empty, we need the following further 
assumptions, firstly on the critical set. We assume C is non- degenerate^ that is, 
m(C) = 0, and 3/3 > such that Va; G Af C we have d(x,CY < \\Df^v\l\v\ < 
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d{x,C)~Nv e T^M, and the functions logdetD/ and \og\\Df-'^\\ are locally 
Lipschitz with Lipschitz constant < d{x,C)~^ . 

Now let ds{x,S) = d{x,S) when this is < S, and 1 otherwise. We assume 
that for any e > there exists S > such that for Lebesgue a.e. x £ M, 



Let / be a map satisfying the above conditions, and for which there exists a > 1 
such that 



Then / admits an acip v with respect to Lebesgue measure, and we may assume 
V to be mixing by taking a suitable power of /. 

Theorem 5 For ip E £°°(M, j/),^ e (i?4,7), we have estimates for (u„) as 
in the indifferent fixed point case (Theorem\^, for t — a. Furthermore, when 
a > 2, the Central Limit Theorem holds for ijj £ (i?4, 7) when 7 is sufficiently 
large for given TZoa(jp)- 

3 Young's tower 

In the previous section, we indicated the variety of systems we may consider. 
We shall now state the main technical result, and with it the conditions a system 
must satisfy in order for our result to be applicable. As verifying that a system 
satisfies such conditions is often considerable work, we refer the reader to those 
papers mentioned in each of the previous subsections for full details. 

The relevant setting for our arguments will be the tower object introduced by 
Young in |Yoj . and we recap its definition. We start with a map F^ : (Ag, mo) O, 
where (Ao,too) is a finite measure space. This shall represent the base of the 
tower. We assume there exists a partition (mod Q) V = : i S N} of Aq, 

such that F^|Ao,i is an injection onto Aq for each Ao,j. We require that the 
partition generates, i.e. that Vj^o(^^) "'^ ^® trivial partition into points. 
We also choose a return time function i? : Aq ^ N, which must be constant on 
each Ao^i. 

We define a tower to be any map F : (A, to) O determined by some F^, V, 
and R as follows. Let A = {{z,l) : z £ Aq, I < R{z)}. For convenience let A; 
refer to the set of points {-,1) in A. This shall be thought of as the Ith level 
of A. (We shall freely confuse the zeroth level {{z,Q) : z £ Aq} C A with Aq 
itself. We shall also happily refer to points in A by a single letter x, say.) We 




n-l 




We define a recurrence time 




heh{{£{x) > n or T{x) > n}) = 0{n-"). 
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write Ai^i = {{z, I) : z G Ao,i} for I < i?(Ao,i). The partition of A into the sets 
Ai^i shall be denoted by rj. 

The map F is then defined as follows: 

{z,l + l) if/ + l<i?(z) 



J^{z,i)-<^ (i^H(^),0) otherwise. 



We notice that the map F^^^^x) on Aq is identical to F^{x), justifying our 

choice of notation. Finally, we define a notion of separation time; for .t, y G Aq, 
s{x,y) is defined to be the least integer n > s.t. {F^)"x, {F^y^y are in 
different elements of V. For x,y G some Ai^i, where x = (xq, I), y = {yo, I), we 
set ,s(.x, y) := s(a;oi Vo)] for x, y in different elements of rj, s{x, y) = 0. 

We say that the Jacobian JF^ of F^ with respect to toq is the real-valued 
function such that for any measurable set E on which JF^ is injective, 



mo{F^{E)) = [ JF^dmo. 



We assume JF^ is uniquely defined, positive, and finite mo-a.e. We require 
some further assumptions. 

• Measure structure: Let B be the cr-algebra of mo-measurable sets. We 
assume that all elements of P and each {F^)~^P belong to B, and 
that F^ and (F^|Ao,i)~^ are measurable functions. We then extend mo 
to a measure m on A as follows: for E C Ai, any ^ > 0, we let m.{E) = 
mo{F~^E), provided that F~'-E G B. Throughout, we shall assume that 
any sets we choose are measurable. Also, whenever we say we are choosing 
an arbitrary point x, we shall assume it is a good point, i.e. that each 
element of its orbit is contained within a single element of the partition rj, 
and that JF^ is well-defined and positive at each of these points. 



Bounded distortion: There exist C > and /3 < 1 s.t. for x,y € any 

Ao,i e r, 



JF^jx) 
JF^y) 



< Cf3- 



• Aperiodicity: We assume that gcd{i?(x) : x G Ao} = f . This is a necessary 
and sufficient condition for mixing (in fact, for exactness). 

• Finiteness: We assume J Rdmo < oo. This tells us that m(A) < oo. 

Let F : (A, m) be a tower, as defined above. We define classes of observ- 
able similar to those we consider on the manifold, but characterised instead in 
terms of the separation time s on A. Given a bounded function ^ : A ^ M, we 
define the variation for n > 0: 

Vni-fp) = sup{\tlj{x) - tp{y)\ : s{x,y) > n}. 
Let us use this to define some regularity classes: 
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Exponential case: -0 G (^l,7)i 7 G (0: 1)) if Vn{ij) = C(7"); 

Stretched exponential case: ip G (V^2,7), 7 e (0, 1), if = 0{e'>q){—n"'}); 

Intermediate case: ip € (^8,7), 7 > 1, if Vn{ip) = ©(expj— (logn)'^}); 

Polynomial case: S (T^4,7), 7 > 1, if Vn(ip) = 0(n~'^). 

We shall see that the classes (Vl-4) of regularity correspond naturally with 
the classes (Rl-4) of regularity on the manifold respectively, under fairly weak 
assumptions on the relation between the system and the tower we construct 
for it. (We shall discuss this further in ^1'6\ ) These classes are essentially those 
defined in ^ , although there the functions are considered to be potentials rather 
than observables. 

We now state the main technical result. 

Theorem 6 Let F : (A,to) O be a tower satisfying the assumptions stated 
above. Then F : (A, to) (3 admits a unique acip v, which is mixing. Further- 
more, for all (p,ip & £°°(A,to), 



where C(^) > is some constant, and (un) is a sequence converging to zero at 
some rate determined by F and w„ ('0) . In particular: 

Case 1: Suppose mo{R > n} = 0{9'^), some 9 e (0, 1). Then 

• if e {VI, 7) for some 7 G (0, 1), then Un = 0{9^) for some 9 e (0, 1); 

• if ip & {V2,j) for some 7 G (0, 1), then Un — 0{e^^'' ) for every 7' < 7; 

• if ip (z (V^3,7) for some 7 > 1, then Un ~ 0(6^^'°^"^^ ) for every 7' < 7; 

• for any constant Coo > 0, there exists C < 1 such that if ^ G (T^4, 7) for 
some 7 > ^, and vo{ip) < Coo, then u„ — 0{n^~'''^). 

Case 2: Suppose TOo{i? > n} ^ 0{n^") for some a > 1. Then for every 
Coo > there exists C, < 1 such that if ^ G (1/4,7) for some 7 > |; with 
vq{'4)) < Coo, then 

• ifj= Un = ©(ni-^logn); 

• otherwise, Un = O (max{n^^" ,n^^'''')'j . 

The existence of a mixing acip is proved in | Yo| . as is the result in the case 
ip S {VI). As a corollary of the above, we get a Central Limit Theorem in the 
cases where the rate of mixing is summable. 

Corollary 1 Suppose F satisfies the above assumptions, and too{-R > n} = 
0{n^"), for some a > 2. Then the Central Limit Theorem is satisfied for 
ip £ (i?4,7) when 7 is sufficiently large, depending on F and wo(V')- 

In H13I we shall give the exact conditions needed on a system in order to 
apply the above results. 
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4 Overview of method 



Our strategy in proving the above theorem is to generahse a coupling method 
introduced by Young in |Yo| . Our argument follows closely the line of approach 
of that paper, and we give an outline of the key ideas here. 

First, we need to reduce the problem to one in a slightly different context. 
Given a system F : (A, m) O, we define a transfer operator F^, which, for any 
measure A on A for which F is measurable, gives a measure i^*A on A defined 

by 

{F^Xm=X{F-'A) 

whenever ^ is a A-measurable set. Clearly any i^-invariant measure is a fixed 
point for this operator. Also, a key property of i^* is that for any function 

6: A^R, 



>oFdX 



(t)diF,X). 



Next, we define a variation norm on m-absolutely continuous signed mea- 
sures, that is, on the difference between any two (positive) measures which are 
absolutely continuous. Given two such measures A, A', we write 



lA-A'l := 



dX dX' 

dm dm 



dm. 



Now let us fix an acip i/ and choose observables ip G C°°{A, v), if) G i3^(A, v) 
with inf ijj > 0, J tpdv = 1. We have 



{ip o F'')iPdiy = I {p o F'')d{iPiy) 
V>diF:ii;u)), 



where tpv denotes the unique measure which has density ip with respect to v. 
So 



{ip o F")il}dv - I ipdv I il)dv 



< 



dF^i^v) dv 



dm dm 

pU\F:{i,v)~v\. 



dm 



Hence we may reduce the problem to one of estimating the rate at which 
certain measures converge to the invariant measure, in terms of the variation 
norm. In fact, it will be useful to consider the more general question of estimat- 
ing \F^X — F"A'| for a pair of measures A, A' whose densities with respect to m 
are of some given regularity. (We shall require an estimate in the case X! ^ v 
when we consider the Central Limit Theorem.) 

Let us now outline the main argument. We work with two copies of the 
system, and the direct product F x F : (A x A, m x m) O. Let Pq = A x A', 
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and consider it to be a measure on A x A. If we let tt, tt' : A x A ^ A be the 
projections onto the first and second coordinates respectively, we have that 

\f:^x-f^x'\ = \tt4fxf)::po~k{fxf)':po\ 

< 2\{FxFy:Po\. 

Our strategy will involve summing the differences between the two projec- 
tions over small regions of the space, only comparing them at convenient times 
that vary with the region of space we are considering. At each of these times, 
we shall subtract some measure from both coordinates so that the difference is 
unaffected, yet the total measure of Pq is reduced, giving an improved upper 
bound on the difference. 

The key difference between our method and that of |Yo| is that we introduce 
a sequence (£«), which shall represent the rate at which we attempt to subtract 
measure from Pq. When the densities of A, A' are of class {VI), (£„) can be 
taken to be a small constant, and the method here reduces to that of jYo| : 
however, by allowing sequences £„ ^ 0, we may also consider measure densities 
of weaker regularity. 

We shall see that it is possible to define an induced map F : A x A — > Ao x Aq 
for which there is a partition of A x A, with every element mapping injectively 
onto Aq X Aq under F. In fact, there is a stopping time Ti : S^i N such that 
for each T G ii, F\T = {F x (r) _ jf choose some T e |i , then {Po\T) is 
a measure on Aq x Aq. The density of Pq with respect to m x m has essentially 
the same regularity as the measures A, A', and the density of F*(Po|r) will be 
similar, except possibly weakened slightly by any irregularity in the map F. 
(We shall see that the map F is not too irregular.) 

Let 

. dh{Po\T) , . 

c(r) = mf — riw). 

weAaxAo a[m x m) 

For any Si G [0, 1], we may write 

F,(Po|r) = eic(r)(m x m|Ao x Aq) + F,(Pi|r) 

for some (positive) measure Pi|r; this is uniquely defined since _F|r is injective. 
Essentially, we are subtracting some amount of mass from the measure F^(Pq\T). 
Moreover, we are subtracting it equally from both coordinates; this means that 
writing T = A x B and k = Ti(r), the distance between the measures F^{X\A) 
and F^{X'\B), both defined on Aq, is unaffected. However, we also see that the 
remaining measure ^^(Pilr) has smaller total mass, and this is an upper bound 
for \f!: iX\ A) ~Fl:{y\B)\. 

We fix an ei, and perform this subtraction of measure for each F G ^i, 
obtaining a measure Pi defined on A x A. The total mass of Pi represents the 
difference between F"A and P"A' at time n = Ti, taking into account that Ti 
is not constant over A x A. Clearly, we obtain the best upper bound by taking 
£1 = 1; however, we shall see that it is to our advantage to choose some smaller 
value for si. 



11 



We choose a sequence and proceed inductively as follows. First, we 

define a sequence of partitions {^i} such that F € is mapped injectively onto 
Ao X Ao under F*. Now given the measure Pj-i, we take an element F e and 
consider the measure F^(Pi_i|F) on Aq x Aq. Here, we let 

c(F) = mf — I A A N 

joeAoxAo d{m X m|Ao x Aq) 

and specify Pj|F by 

F:{P^-l\T) = e,c(F)(m x m|Ao x Ao) + ^^(P.lr). 

As before, we construct a measure Pi, the total mass of which gives an upper 
bound at time Tj. To fully determine the sequence {Pi}, it remains to choose a 
sequence (e^). Our choice relates to the regularity of the densities This 
is relevant because the method requires that the family of measure densities 

(=f^:i>i.r.a 

I d{m X m) I 

has some uniform regularity. (In fact, we require that the log of each of the above 
densities is suitably regular.) We require this in order that at the ith stage of 
the procedure, when we subtract an Si proportion of the minimum local density, 
this corresponds to a similarly large proportion of the average density. Hence, 
provided this regularity is maintained, the total mass of Pi-i is decreased by a 
similar proportion. 

When we subtract a constant from a density as above, this weakcins the 
regularity. However, at the next step of the procedure, we work with elements 
of the partition ^i+i. Since these sets are smaller, we regain some regularity by 
working with measures on Aq x Aq pushed forward from such sets. That is, we 
expect the densities 

jdFi+\p,\r) 



I d{m X m) 
to be more regular than the densities 

i dFl{Pi\r) 
I d{m X m) 



i+l 



■■re^i 



(This relies on the map F being smooth enough that another application of the 
operator F* doesn't much affect the regularity.) 

The degree of regularity we gain in this way depends on the initial regularity 
of and hence of 4^, 4^. with respect to the sequence of partitions. In the 

' am ' am ^ r- 

usual case, where J^, ^ G (Vl), the regularity we gain each time we refine the 
partition is similar to the regularity we lose when we subtract a small constant 
proportion of the density; hence we may take every Si to be a small constant 
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£. Where the initial regularities are not so good, we gain less regularity from 
refining the partition, and so we may only subtract correspondingly less measure. 

For this reason, outside the (1^1) case we shall require that the sequence (e^) 
converges to zero at some minimum rate. However, if (sj) decays faster than 
necessary, we will simply obtain a suboptimal bound. So part of the problem is 
to try to choose a sequence (si) decaying as slowly as is permissible. We shall 
also need to take into account the stopping time Ti (which is unbounded), in 
order to estimate the speed of convergence in terms of the original map F. 

5 Related work 

Let us now mention some other results concerning estimates on decay of corre- 
lations for non-Holder observables. Most of these are stated in the context of 
one-sided finite alphabet shift maps, or subshifts of finite type. (For a compre- 
hensive discussion of shift maps and equilibrium measures, we suggest the book 
of Baladi, |Ba|.l Shift maps are relatively simple dynamical systems, but are 
often used to code more complicated systems via a semi-conjugacy, in much the 
same way that each of the examples we consider can be represented by a suit- 
able tower (see El- Where a system F : X ^ X being coded has an invariant 
measure n which is absolutely continuous with respect to Lebesgue measure, 
H is an equilibrium measure for the potential (f> = — log JF, where JF is the 
Jacobian with respect to Lebesgue measure. Most results for shift maps work 
with an equilibruim measure given by a potential (j) which is Holder continuous 
(in terms of the usual metric on shift spaces - two sequences are said to be dis- 
tance 2~" apart if they agree for exactly the first n symbols). This assumption 
corresponds to assuming good distortion for JF. 

The have been various results concerned primarily with weakening the as- 
sumption on the regularity of (j), and obtaining (slower) upper bounds for the 
rate of mixing with respect to the corresponding equilibrium measures. Kondah, 
Maume and Schmitt ( jKMS| 'l used a method of Birkhoff cones and projective 
metrics, Bressaud, Fernandez and Galves (p3FGj) used a couphng method (dif- 
ferent from the one described here), with estimates given in terms of chains of 
complete connections, and PoUicott ([El) introduced a method involving com- 
posing transfer operators with conditional expectations. Each of these results 
has slightly different assumptions and gives slightly different estimates, but in 
each case a number of difi'erent classes of potentials are considered, and esti- 
mates are given for for observables of some similar regularity to (usually not 
much worse than) the potential. In particular, in all three examples polyno- 
mial mixing is given for a potential and observables with variations decaying at 
suitable polynomial rates. 

In addition, Fisher and Lopes ( |FLj ) and Isola ( I ) have obtained polynomial 
decay of correlations for some specific classes of potentials on the full 2-shift, 
each for a class of observables not unlike our {VA) class. 

We emphasise that each of the above results concerns only shifts (or sub- 
shifts) on a finite alphabet. Furthermore, with the exception of uniformly ex- 
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panding maps f tj'i.lll . none of the examples we consider may be coded (with a 
sufficiently regular semi-conjugacy) by shifts with finite alphabets, due to distor- 
tion considerations. Hence outside the uniform case, there is no direct overlap 
between the above results and our results. These are essentially generalisations 
in a different direction, but are worth mentioning if only to note the variety 
of different methods that have been applied to obtain results for non-Holder 
observables in slightly different contexts. 

We now move on from shift spaces to mention a result on towers. Following 
the method of |KMSj mentioned above, Maume-Deschamps (^]) essentially re- 
produced Young's results ( |Yoj ) . with some slightly weaker estimates. Together 
with Buzzi f |BMlp . the method has been extended to allow the bounded distor- 
tion assumption of ^to be replaced by a condition on the variation of JF, and 
also to allow observables of similar regularity. One important difference is that 
the variation is defined in terms of JF and a partition of the whole tower (the 
partition 77 of ^ rather than JF^ and the base partition, as here. This signifi- 
cantly reduces the classes of observables they can consider; for instance, for any 
tower with unbounded return times, our class {VI) contains many functions that 
cannot be dealt with at all by their method. It is not clear that any estimates 
can be obtained for our examples, except for the uniformly expanding case; for 
Manneville-Pomeau maps, for instance (see ^2.21) . while we can construct the 
same tower in their context, the semi-conjugacy between the tower and the sys- 
tem is not regular enough to give any comparable results. Some applications are 
given in the related paper |BM2| . including certain multi-dimensional piecewise 
expanding afhne maps. 

We also note that a similar result to that of |BMlj was obtained by Holland 
f jHolp . using a coupling method very similar to the one we use here. 

Finally, we mention a result which applies directly to certain non-uniformly 
expanding systems, rather than to a symbolic space or tower. PoUicott and 
Yuri (|PYj) consider a class of maps of arbitrary dimension with a single in- 
different periodic orbit and a given Markov structure, including in particular 
the Manneville-Pomeau interval maps. The class of observables considered is 
dynamically defined; each observable is required to be Lipschitz with respect to 
a Markov partition corresponding to some induced map, chosen to have good 
distortion properties. This class includes all functions which are Lipschitz with 
respect to the manifold, and while some estimates are weaker than compara- 
ble results for Holder observables, bounds are obtained for some observables 
which cannot be dealt with at all by our methods, such as certain unbounded 
functions. 

6 Coupling 

Over the next few sections, we give the proof of the main technical theorem. 

Let F : (A, to) O be a tower, as defined in ^ Let T = {ip : A ^ R \ Vn{ip) 
0}, and let X+ ~ {ip E T : inf (p > 0}. We shall work with probability measures 
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whose densities with respect to m belong to X+. We see 

where depends on inf ip. 

Let A, A' be measures with ^ ^ , and let P = A x A'. For con- 
venience, we shall write u„(A) = v„( J^) for such measures A, and use the two 
notations interchangeably. Wc let C\ = C^p above, where = We shall 
write V for the unique acip for F, which is equivalent to m. 

We consider the direct product F x F : (A x A,rn x m) O, and specify a 
return function to Aq x Aq. We first fix no > to be some integer large enough 
that m(F~"'Ao fl Aq) > some c > for all n > no- Such an integer exists since 
u is mixing and equivalent to m. Now we let R(x) be the first arrival time to 
Aq (setting i?|Ao = 0). We define a sequence {rj} of stopping time functions 
on A X A as follows: 

n{x,y) = no+R{F''"x), 

and so on, alternating between the two coordinates x, y each time. Correspond- 
ingly, we shall define an increasing sequence ^1 < ^2 < • • • of partitions of A x A, 
according to each r^. First, let tt, tt' be the coordinate projections of A x A 
onto A, that is, tt{x, y) := x, i^'{x, y) := y. At each stage we refine the partition 
according to one of the two coordinates, alternating between the two copies of 
A. First, ^1 is given by taking the partition into rectangles E x A, E & rj, and 
refining so that n is constant on each element F G ^1, and F'^^ |7r(r) is for each 
F an injection onto Aq. To be precise, we write 

(Ti(x,y)-1 \ 
V F-^rjj (x) X A, 

using throughout the convention that for a partition ^, ^{x) denotes the element 
of ^ containing x. Subsequently, we say is the refinement of such that 
each element of is partitioned in the first (resp. second) coordinate for i 
odd (resp. even) so that Tj is constant on each element F G ^j, and F'^* maps 
7r(F) (resp. 7r'(F)) injectively onto Ag. 

We define T to be the smallest Tj, i > 2, with (F x Fy^(x, y) G Aq x Aq. 
This is well-defined m-a.e. since v x v \s ergodic (in fact, mixing). Note that 
this is not necessarily the first return time to Aq x Aq for F x F. We now 
consider the simultaneous return function F := {F x F)'^ , and partition A x A 
into regions which F" maps injectively onto Aq x Aq. 

For i > 1 we let Tj be the time corresponding to the ith iterate of F, i.e. 
Ti = T, and for i > 2, 

Ti{z)=Ti-i{z)+T{F'-^z). 



fix) 

f{y) 
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Corresponding to {Tj} we define a sequence of partitions ?7 x 77 < < ^2 < • ■ • 
of A X A similarly to before, such that for each F e Tn\T is constant and F" 
maps r injectively onto Aq x Aq. It will be convenient to define a separation 
time s with respect to <^i; s{w,z) is the smallest n > s.t. F"w, F^^z are 
in different elements of ^1. We notice that if w = {x,x'), z — (jj,y'), then 
s{w,z) < mm{s(x,y),s{x'^,y')). 

Let in = 4^. in' = and let $ = , ''^ We first consider the 

~ dm^ ~ dm ' dmxm ^ ~ 

regularity of F and <i> with respect to the separation time s. 
Sublemma 1 

1. For o/i w, z e A x A with s{w, z) > n, 

JF"-{w) 



log 



JF'^iz) 



for Cp depending only on F. 
2. For all w,z e A X A, 



loe 



^z) 



s(w,z) 



where w„(4') := max (w„((^), ■i;„((/9')), and C$ = C^p + C^i, where C^,C^' 
are the constants given in ^ above, corresponding to ip, if' respectively. 

Proof: Let w — {x,x'), z — (?/,?/')■ When s(u;,z) > n, there exists fc S N 
with F" = (F X F)^ when restricted to the element of ^„ containing w, z. So 



log 



JF'\w) 



JF^{z) 



< 



log 
log 



JF^{x)JF^{x') 



JFHy)JF^{y') 
JF^ix) 

' log 



JF'^iy) 



JF^{x') 



JFHy') 



Let j be the number of times F'^{x) enters Aq, ioi i — 1, . . . , k. We have 



log 



JFf'ix) 



JF^{y) 



< 





E 

i=l 



cp 



siF''x,F''y) + (j-i) 



< (jl ps{F''x,F''y) 

for some C" > 0, and similarly for x' ,y' . So 

JF"(u>) 



log 



JF"(0) 
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for some Cp > 0. For the second part, we have 





< 




+ 












ip'{y') 



< 
< 



C^Vs{x.,y){v) + C^iVs[x',y'){'f') 
C*Ws(tu,z)($)- 



□ 

We now come to the core of the argument. We choose a sequence (e^) < 1, 
which represents the proportion of P we try to subtract at each step of the 
construction. Let Vo = '/'o = We proceed as follows; we push-forward 
$ by F to obtain the function 'il)i{z) := j^^^^ . On each element F G we 

subtract the constant ei mi{ilji{z) : z e F} from the density "01 |r. We continue 
inductively, pushing forward by dividing the density by JF{Fz) to get 'ip2{z), 
subtracting 62 inf{'02(z) : z £ F} from ?/'2|F for each F e ^2, and so on. That is, 
we define: 



^i-i{z) 
JF{F^-^zy 
£i inf ^i{w); 

ipi{z) - Si., 



We show that under certain conditions on (e^), the sequence {V'i} satisfies a 
uniform bound on the ratios of its values for nearby points. 

A similar proposition to the following was obtained simultaneously but in- 
dependently by Holland { |Holj 1 : there, the emphasis was on the regularity of 
the Jacobian. 

Proposition 1 Suppose (e^) < ^ is a sequence with the property that 

i 

v,{<^)Y[{l + e'^)<Ko, (4) 




(5) 



are both satisfied for some sufficiently large constant Kq allowed to depend only 
on F and uo($). Then there exist S < 1 and C > each depending only on F, 
C$ and vq{^) such that if we choose Ei — Ss'^ for each i, then for all w, z with 
s{w, z) > i > I, 



log-f 



< c. 
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Proof: Suppose we are given such a sequence {e[) and assume that for each 
i we have 



log^ 



log 



ipiiw) 



(6) 



for every with s{w,z) > i. We shall see that we may achieve this by a 
suitable choice of {si). We note that when s{w, z) > i, 





< 


log -= 






-01-1 (2) 



so 



log^ 



V'i-i(w) 
log-- 



Applying this inductively, we obtain the estimate 



logi^^) 



Uz) 



+ (1 + e'i)CpP'("'''^-' + (1 + £^(1 + e^_i)Cj=,r("''")-(^-i) 



+ ... + Y[il + e'j)CpP'^'"''^-\ 

We see this is bounded above by the constant (C# + P^^Cp)Ko —: C. So we 
have 

"^'^^^ <C + Cp for s{w,z)>t. 



log 



'(pi{z) 



It remains to examine the choice of sequence (si) necessary for © to hold. For 
now, let (si) be some sequence with Si < e[ for each i. Let F = ii{w) — ii{z) 
and write SiT '■= Si w = £i.z- Then 



log^ 



•^iiz) 



log 



■ipi{w) 



^iiz) 



< 



< 




(Mz) 

Six 

1 



£i,r 



ipi (w) 



for some Ci > 0. 
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We see that < 



< Si for all w E r, and 



gj r _ Si r 

- 2' 

so Ci may be chosen so as not to depend on anything. Continuing from the 
estimate above, 







1 ( \ 


< 











1 



< C1C2 



£i 



log 



where C2 may be chosen independently of i,w, z since 



tpi (w) 



pro- 



vided that at each stage we choose Si small enough that CiC2jz^ < £[■ 

We confirm that it is sufficient to take £i = 6e'^ for small enough 6 > Q. This 
means 

e,. „ „ Se', C1C2S , 



C1C2 

so taking S = j^^^ 



is sufficient 



— C1C2 



1 - Se' 



< 



1 



□ 



7 Choosing a sequence (sj) 

Having shown that it is sufficient for our purposes for the sequence (e^) to 
satisfy conditions Q and (jSJ , we now consider how we might choose a sequence 
(ci) which, subject to these conditions, decreases as slowly as possible. Having 
chosen a sequence, we shall then estimate the rate of convergence this gives us. 

Lemma 1 Given a sequence there exists a sequence (e^ < ^ satisfying 

^ and |3|) such that for Ei = 6e'^, and any K > 1, 

n(l-|) <Cmax(z;,($)*,0^) 

for some 9 < I depending only on F , and some C > 0. 

Proof: We start by defining a sequence {v*) > as follows: we let Vq — vq{^) 
and V* = max(?;i(<i>), cv*_^), where c is some constant such that exp{— min(i, l3~^ — 
1)} < c < 1. We claim that v* = 0{vi{^)) unless Vi{^) decays exponentially 
fast, in which case v* decays at some (possibly slower) exponential rate. To 
see this, suppose otherwise, in the case where decays slower than any 

exponential speed. Then for large i certainly v* > Vi{^), and so v* = cv*_i 
for large i, and (v*) decays exponentially fast. But this means decays 
exponentially fast, which is a contradiction. 
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Let us now choose £[ — log^^pr^. (We ignore the trivial case t;o(<&) = 0.) 
We see that aU terms are smah enough that Q is satisfied, and in particular, 



£i ^ 5- Furthermore, 



For any K > 1, 



= <exp{logwo - log<} 



= '^'^P I ~ ;^ (log ^'o - log ^* ) 
= («o(<i>)-^<)^. 

If (u*) decays exponentially fast, we get an exponential bound. Otherwise, 
v*^ =0{v^{^)i). 

8 Convergence of measures 

We introduce a sequence of measure densities $o = 5'2>-- - corre- 

sponding to the sequence {V'l} in the following way: 

Lemma 2 Given a sequence (e^) — {5e'^ satisfying the assumptions of Propo- 
sition^ there exists K > 1 dependent only on F , C$ and Vf){'^) such that for 
all z e A X A, i > 1, 



Proof: If we fix i > 1, F G and w, z G F, then by Proposition ^ we have 



where Co = e*-^ > 1. From Sublemma|2| we have 

1 - 1 



JF{Fi-^w) JF{F^-^z)' 
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so 



Jpi-'^iw) JF{F'-^w) ~ ° JFi{z) ' 
Now we obtain a relationship between and by writing 

= inf -i^M-]jF\z) 

\JF{Fi-^z) weU^.) JF{F^-^w) J 

= i^-e. inf M^V^^(^)- 
So for any z e A x A we have that 

^ ^ - I JFi{z) K JF-{z) ) ^ ' 



0-1) 



where if = (7oe'^F. □ 

The above lemma gives an estimate on the total mass of $i for each i. To 
obtain an estimate for the difference between F"A and -F"A', we must use this, 
and also take into account the length of the simultaneous return time T. 

Lemma 3 For all n > 0, 

n I i \ 

\F:\ - F^A'l < 2P{T >n} + 2Y,\^{}-j^)\P{Ti<n< T,+i}, 

i=i \j=i / 

where K is as in the previous lemma. 

Proof: We define a sequence {(f>i} of measure densities, corresponding to 
the measure unmatched at time i with respect to F x F. We shall often write 
$i(m X m), say, to refer to the measure which has density $j with respect to 
m X m. For ^ e A x A we let ^n{z) = ^i{z), where i is the largest integer such 
that Ti{z) < n. Writing $ = $„ + Efc=i(*fc-i - ^fe), we have 

|F,"A-F,"A'| = \n4FxF)^mmxm))-7r:{FxF):mmxm))\ 

< IMF X F):($„(m X m)) - x F):{^„{m x m))| 

n 

+ J2\in.- <) [{F X - $fc)(m x m))]| . (7) 

The first term is clearly < 2/ $„rf(m x m). Our construction should ensure 
the remaining terms are zero, since we have arranged that the measure we 
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subtract is symmetric in the two coordinates. To confirm this, we partition 
A X A into regions on which each Tm is constant, at least while T„i < n. 

Consider the family of sets Ak^i, i,k G N, where Ak^i := {z e A x A : Ti(z) = 
k}. Clearly, each Ak^i is a union of elements of ^i, and for any fixed k the sets Ak^i 
are pairwise disjoint. It is also clear that on any Ak^i, $fe-i = <&i-i — 

and for any k, $fc_i = $fe on A x A — UiAk^i- So for each fc, 

7r,(i^xF):^(($fe_i-$fe)(mxm)) 
' re(5.|A,,.) 

We show that this measure is unchanged if we replace tt with tt' in the last 
expression. Let i? C A be an arbitrary measurable set, and fix some F £ S,i\Ak^i. 
Then 

7r,F:(($,_i - ^,){{m X m)\T)){E) 
= F: ((s.JF^ inf ) ((^ X ™)|r) I {E X A) 



^e.,CJF\m X m)j [F^^E x A) n T) 
where C is constant on T. This equals 

£iCJF^d{m X m) — £iC{m x m){E x A). 

F-'(£;xA)nr 

Since (m x m)(i? x A) = (m x m)(A x E), the terms of the sum in 10 all have 
zero value, as claimed. 
Now 

/oo „ 
$„d(m X m) = / $„d(m x m); 

in fact, since Ti > i, all terms of the series are zero for i > n. For 1 < i < n. 



= / f n (i 

{T,<n<T, + i} J{T,<n<Ti+i} J {Ti<n<T, + i} ^^^^ ^ 



The estimate claimed for |F"A — F"A'| follows easily. □ 
Finally we state a simple relationship between P{T > n} and (m x m){r > 

n}. From now on we shall use the convention that P{condition|r} := -p^Plx G 

F : X satisfies condition}. 

Sublemma 2 There exists K > depending only on C$ and vq(^) s.t. Vi > 1, 

VF e 6, 

P{T,+i - > n|F} < K{mo x mo){T > n}. 

The dependence of K on P may be removed entirely if we take only i > some 
MP)- 
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Proof: Let ^ = p^FUPlT). We see that P{T,+i - T, > n|r} = ^{T > n}. 
We prove a distortion estimate for j(y„|^xmo) ' ^^i'^S estimates of Sublemma 
n] Let w, z G Af) X Aq and let Wo, zq G T he such that F^ii^o = w, F'^zq = z. 
Then 



log 



d{mxm,) 



{z) 



< 



lo; 



$(wo) 



+ 



log 



This gives 



(i(TO X m) 
and hence the result follows. 



< 



< 



{mo X mo)(Ao x Ao) 



9 Combinatorial estimates 

In Lemmainiwe have given the main estimate involving P, T, and the sequence 
(ei). It remains to relate P and T to the sequence molR > n}. Primarily, 
this involves estimates relating the sequences P{T > n} and toqI-R > n}. 
We shall state only some key estimates of the proof, referring the reader to 
|Yo| for full details. Our statements differ slightly, as the estimates of Yo] 
are stated in terms of m{R > n}; they are easily reconciled by noting that 
m{R > n} = J2i>n''^o{R > i}- (As earlier, i? > is the first arrival time to 
Ao.) 

Proposition 2 1. If mo{R > n} = 0(6") for some < < 1, then P{T > 
n} = 0{9i), for some < 61 < 1. Also, for sufficiently small 61 > 0, 

P{T, <n< T,+i} < Ce'" for i < S^n, 

for some < 9' < 1, C > independent of i. The constants 9i, 9', 5i 
may all be chosen independently of P. 

2. Ifmo{R > n} = 0{n~°') for some a>l, then P{T > n) ^ 0{n^-'^). 

This proposition follows from estimates involving the combinatorics of the 
intermediate stopping times {r^}. Let us make explicit a key sublemma used in 
the proofs, concerning the regularity of the pushed-forward measure densities 
; for the rest of the argument we refer to |Yo| . as the changes are minor. 

Sublemma 3 For any k > 0, let Q e Vf=o^ ^"''^ ^-^^ ^''^ = ^o- Let 
H = F^(A|n). Then Va;, y £ Aq, we have 



dm 



<Co 



for some C'o{X), where the dependence on X is only on i'o(A) and C\, and may 
be removed entirely if we only consider k > some fco(A). 
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Proof: Let (p — Rx x,y € Aq, and let xo,yo be the unique points in fl 
such that Pf^xo = X, P^yo = y. We note that = ^(a;o) • '^^"('"1") [xq). So 



ip{xQ) JP'^yo 



< 



< 



JP'^xq (f{yo) 

JP'^yo 

vivo) 



ip{xo) 
JF'^yo 



1 



(p{xo) ifiiyo) 



JP^xo JPi^yo 
1 



l^pixo) - ip{yo)\ 



JPl^XQ 



< (1 + C^Vj{ip))C' + < (1 + C^voiip))C' + C^voi^), 

where j is the number of visits to Aq up to time k. Clearly the penultimate 
bound can be made independent of A for j > some jo (A). 

The following result combines the estimates above with those of the previous 
section. 

Proposition 3 1. When malR > n} < CiO" , 



\p:x-p:x'\<c9'^' + 2 11(1 

i=[5in] + l \j=l 

for some < 0' < 1 and sufficiently small Si . 
2. When mQ{R > n} < dn^", a > I, 



\p:x-p:x'\<2cw 



[Sin] I 

cn-5:."n(i-|) 

i=i ]=i 



i=[5in] + li=l 

for sufficiently small 5i . 
Proof: In the first case, Proposition [5] and LemmaOtell us that 

[<5in] n i 

\p:\ - p:\'\ <c0^ + 2Yp {T^ < ?i< T,+i} + 2 Y n (i - ^ 

1=1 i=[Sin] + lj=l 



for any < Si < 1, for some < 6*0 < 1. For sufficiently small 6i, the middle 
term is < C[Sin]9'", which decays at some exponential speed in n. 
In the second case, for any < (5i < 1 we have 



\p:x - p:x'\ < cn'"^ + 2 ^ n (i - 1) ^ " < ^^+1} 

i=i 



i=lSin] + lj=l 



K 
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We estimate the middle term by noting that 



< K{i + l){mxm)\T> 



i + 1 

< Cn^-°'{i + 1)" for some C > 0. 

For the last step, note that Proposition|2] applies to the normalisation of (to x to) 
to a probability measure. 

10 Specific regularity classes 

We now combine all of our intermediate estimates to obtain a rate of decay of 
correlations in the specific cases mentioned in Theorem First, we set C — "R'i 
which can be seen to depend only on F, C$ and vo{'^). Throughout this section, 
we shall let C denote a generic constant, allowed to depend only on F and 
which may vary between expressions. 

10.1 Exponential return times 

In this subsection, we suppose that too{-R > n} = 0(0"), and hence m{R > 

n} = o{e"). 

Class (VI): Suppose «;($) = C>{6\) for some 9i < 1. By Lcmma^we may 
take (si) such that conditions and Q are satisfied, and 

for some < 02 < 1- Applying Proposition Owe have 

\Fl'\ - Fl'X'l < (76*'" + C J2 ^2 

i> [Sin] 

for some 9' < 1 and sufficiently small 6i > 0. This give the required exponential 
bound in n. 

Class (V2): Suppose Wi($) = 0{e-^''), for some 7 G (0,1). Then there 
exists (sj) such that 

n(i_l).o(e-n 

So 

i>[i5in] 
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We see that e ""'^ — 0{e ) for every < 7' < 7, and it is well known that 

the sum is of order e~" for every < 7" < 7'. 

Class (V3): Suppose = O (e-(i°s')'') for some 7 > 1. We may take 

[sj) such that 

So 

i>[5in] 

It is easy to show that e-'^C"^^)" ^ q (^e-(i°8 ^ for every < 7' < 7. So the 

sum is of order C)(e~^'°s")"' ) for every < 7" < 7. 

Class (V4): Suppose Vi{^) — 0{i'^'^) for some 7 > i. Then we can take 
(Ej ) such that 

n(i-l)<crc.. 

So 

i=[<5iJi+l] 

10.2 Polynomial return times 

Here we suppose mo{R > n} = 0{n~°') for some a > 1. Suppose = 
0{n~''), for some 7 > |- We can take (e^) such that 



By Proposition 13 for some Si, 



[Sin] 00 

C7 



i=l i=[5iri] + l 

The third term here is of order n^~''^ . To estimate the second term, we 
consider three cases. 

Case 1: 7 > ^^ii. Here, a — C7 < ~li so the sum is bounded above 
independently of n, and the whole term is 0{n^~°'). 

Case 2: -f=^. The sum is 

/[Sm] 
x-^dx = 1 + log[(5in] = ©(logn). 

'i^ [Oi nj 



26 



So the whole term is 0{n^ " logn). 

Case 3: | < 7 < The sum is of order 71"+^"'=''', and so the whole term 

is 0{n^-^^). 



11 Decay of correlations 

Finally, wc show how estimates for decay of correlations may be derived directly 
from those for the rates of convergence of measures. 

Let (^,'0 € £°°(A,m), as in the statement of Theorem We write V' 
b{ip + a), where a — 1 ~ inf ip and b is such that J ipdi/ = 1. We notice that 



be 



l+voiip) ' 

Now let p - 



and that inf -ip — b, snpip < 1 + vq^t/j). 



dm 



and let A be the measure on A with 



dX 
dm 



(ip o F") ipdv - / ipdv / ipdv 



< - 



(<^oF")V^diy- 



ipdi^F:i'4,pm) 
dFl'X 



tpp. We have 
(pdu J '(jjdv 
Lppdm 



< 



dm 



dm 



It remains to check the regularity of ipp. First, 



< 
< 



{p{x) - p{y)) + p{y) {^p{x) - -0(2/)) 
l^lloo \p{x) - p{y)\ + IIpIIooI&I - 0(2/)| 



It can be shown that p is bounded below by some positive constant, and 
Vn{p) < C/3". (This is part of the statement of Theorem 1 in |Yo|.) So ^p is 
bounded away from zero, and Vnii/jp) < C/3" + Cvn{i^), where C depends on 
Vo{iIj). 

Taking A' — we see that < C/?" + Cvn{ip)- This shows that esti- 

mates for |F"A — i^"A'| carry straight over to estimates for decay of correlations. 
To check that the dependency of the constants is as we require, we note that 
we can take 



1 



1 



inf 

dm 



< 



1 



inf V'/O b inf p 



< 



inf p 



So an upper bound for this constant is determined by fo(V'). Clearly C$ depends 
only on F and an upper bound for vo{ip), and in particular these constants 
determine C — 
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12 Central Limit Theorem 



We verify the Central Limit Theorem in each case for classes of observables 
which give summable decay of autocorrelations (that is, summable decay of 
correlations under the restriction ip — ip). 

A general theorem of Liverani (^) reduces in this context to the following. 

Theorem 7 Let {X,J-,fi) be a probability space, andT : X O a (non-invertible) 
ergodic measure-preserving transformation. Let Lp G C°° {X, ji) be such that 
J ipdfj, = 0. Assume 



E 

n=l 



{if O T'')ipdfl 



< OO, (8) 



^^(T*"(^)(a;) is absolutely convergent for fi-a.e. x, (9) 

ra=l 

where T* is the dual of the operator T : ip ip o T. Then the Central Limit 
Theorem holds for ip if and only if ip is not a coboundary. 

In the above, the dual operator T* is the Perron-Frobenius operator corre- 
sponding to T and /i, that is 



(T ip){x) = 2^ 



y:Ty=x 

Of course the Jacobian JT here is defined in terms of the measure fi. 

Let </? : A — > R be an observable which is not a coboundary, and for which 

C„(<^, ip] v) = j {ip o F")ipdu - ( J ipdu) 

is summable. Let (j) — ip — J ipdv, so that J (pdv — J ipdv — /(/ ipdv)dv — 0. 

We shall show that </) satisfies the assumptions of the theorem above. It is 

straightforward to check that C„(iy9,(/?;i^) = C„((/), 0; z^) = \ ^ {(j) o F"-)4>dv\. Hence 

condition ||HJ) above is satisfied for 0. 

Since m and u are equivalent measures, it suffices to verify the condition in 
m-a.e. The operator F* is defined in terms of the invariant measure, so for 

a measure A <C m it sends ^ to ^f^^. By a change of coordinates (or rather, 

of reference measure) , we find that 

where V is the Perron-Frobenius operator with respect to m, that is, the oper- 
ator sending densities 4^ to 

^ am am 
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We shall now write (p ^ the difference of the densities of two (positive) 
measures of similar regularity to cf). We let cf) = b{(j) + a), for some large a, with 
6 > chosen such that / (j>pdm = 1. We define measures A, A' by 

— = [h(j) + (j)}p, — = (pp. 
am dm 

It is straightforward to check this gives two probability measures, and that 

\ dm dm / 

As we showed in the previous section, Vn{4>p) < C/3" + Cvn{<j>) for some C > 0. 
Also, b<p + (p — b{2(p + a), which is bounded below by some positive constant, 
provided we choose sufficiently large a. We easily see i;„(A), ^^(A') < Cvn{<p)- 

We now follow the construction of the previous sections for these given mea- 
sures A, A', and consider the sequence of densities $„ defined in SjHl We have 

- F^X' = TT,{Fx Fy:{<S>n{m x m)) - <(F x F):'($„(to x m)). 

Let ipn be the density of the first term with respect to m, and the density 
of the second. Since 7^ is a linear operator, we see that 



dFl'X dF^X' 



dm dm 

These densities have integral and distortion which are estimable by the construc- 
tion. We know / tpndm = / ^j'^dm = f $„d(m x m). In the cases we consider 
(sufficiently fast polynomial variations) this is summable in n; notice that we 
have already used this expression as a key upper bound for ^\F^X — i^"A'| 
(see Lemma . It remains to show that a similar condition holds pointwise, 
by showing that tpn, 4'n both have bounded distortion on each A;, and hence 
l-FJ'A — F"A'| is an upper bound for -0„ + 'fp'm up to some constant. This fol- 
lows non-trivially from Proposition^ which gives a distortion bound on {'I'fc}, 
and hence on {$„} when we restrict to elements of a suitable partition. The 
remainder of the argument is essentially no different from that given in jYo| . 
and we omit it here. 



13 Applications 

Having obtained estimates in the abstract framework of Young's tower, we now 
discuss how these results may be applied to other settings. First, we define 
formally what it means for a system to admit a tower. 

Let X be a finite dimensional compact Riemannian manifold, with Leb de- 
noting some Riemannian volume (Lebesgue measure) on X. We say that a 
locally non-uniformly expanding system f : X O admits a tower if there 
exists a subset Xq C X, Lcb(Xo) > 0, a partition (mod Leb) V of Xq, and a 
return time function R : Xq N constant on each element of V, such that 
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• for every lu £ V, f^\oJ is an injection onto Xq; 

• and (/^|a;)~^ are Leb-measurable functions, Voj € V; 

• Vjlo(/^) "'^ trivial partition into points; 

• the volume derivative det Df^ is well-defined and non-singular (i.e. < 
I detD/^l < oo) Leb-a.e., and 3C > 0, ^9 < 1, such that VtJ G V, "ix,y £ 
bj, 

det Df^ {x) 
dctDfR{y) 

where s is defined in terms of /^^ and V as before. 

We say the system admits the tower F : (A, to) O if the base Aq = Xq, 
m|Ao — Leb|Xo, and the tower is determined by Aq, i?, :— and V as 
in 33 It is easy to check that the usual assumptions of the tower hold, except 
possibly for aperiodicity and finiteness. In particular, jdetD/^l equals the 
Jacobian JF^ . 

If F : (A, to) O is a tower for / as above, there exists a projection tt : A X 
we shall simply call the tower projection^ which is a semi-conjugacy between / 
and F; that is, for x G A;, with x = F^xq for xq € Aq, 7r(x) :— /''{xq). In 
all the examples we have mentioned in ^ the standard tower constructions (as 
given in the papers we cited there) provide us with a tower projection tt which 
is Holder-continuous with respect to the separation time s on A. That is, given 
a Riemannian metric d, in each case we have that 

3/3 < 1 such that for x,y e A, d{TT{x), 7r(y)) = 0(/3'(^'^)). (10) 

Note that the issue of the regularity of tt is often not mentioned explicitly in the 
literature, but essentially follows from having good distortion control for every 
iterate of the map. (Formally, a tower is only required to have good distortion 
for the return map F^, which is not sufficient.) 

Given a system / which admits a tower F : (A, to) O with projection tt 
satisfying HlOfl . we show how the observable classes {Rl — 4) on X correspond 
to the classes {VI — 4) of observables on A. Recall that for given ip, 

TZeiip) := sup{|V'(a;) - -0(^)1 : d{x,y) < e}. 

Given a regularity for ip in terms of TZdi^), we estimate the regularity of 
ip o TT, which is an observable on A. 

Lemma 4 • If ip E {Rl,j) for some 7 e (0, 1], then ip o tt E {VI); 

' if ijj E (i?2, 7) for some 7 E (0, 1), then ip o tt E {V2, 7') for every 7' < 7; 

• if tp E (i?3, 7) for some 7 > 1, then ijj o tt E {V3, 7') for every 7' < 7; 
' if Ip E (i?4, 7) for some 7 > 1, then -tjj o tt E {Vi, 7). 



< C(3- 



•s{F'^x,F'^y) 
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Proof: The computations are entirely straightforward, so wc shall just make 
explicit the (i?4) case for the purposes of illustration. 

Suppose TZs{ip) — Odlogep''), for some 7 > 0. Then, taking n large as 
necessary, 

= C(nlog/3~i -logC)-^ 
< C(|log/3-i)-^ 

□ 

Let us point out that the condition (|10|) is not necessary for us to apply these 
methods. If we are given some weaker regularity on tt, the classes (VI — 4) shall 
simply correspond to some larger observable classes on the manifold. It remains 
to check that the semi-conjugacy tt preserves the statistical properties we are 
interested in. 

Lemma 5 Let v he the mixing acip on A given by Theorem Given ip : 
X ^ M., let (f ^ ip o TT , -tjj ^ -tp o TT . Then 

Cn{(p,ip;TT*v) ^Cn{ip,ijj;iy). 

Lemma 6 Suppose the Central Limit Theorem holds for (F, v) for some ob- 
servable tp : A M. Then the Central Limit Theorem also holds for (/, tt^j/) 
for the observable ip — (p o n. 

The truth of these lemmas follows easily from the relevant definitions. Fi- 
nally, we should verify that the measure tt*:/ is a suitable measure to work with. 

Lemma 7 The measure tt^i/ is an absolutely continuous invariant probability 
measure for f , with respect to Lebesgue measure. 

Proof: Firstly, tt* v is easily seen to be a mixing invariant probability measure 
for /. The measure ly is equivalent to m, so tt^i^ is equivalent to 7r*m. We show 
TT^TO <C Leb. Let A C X, and assume n^.m^A) > 0. Let 77 be the usual partition 
of A; there exists E E rj with m{Ti^^A n i?) > 0. So there exists Eq E V and 
i < R{Eo) such that m{F~^TT~^A n Eq) > 0. This means there exists Aq C E^ 
such that m(^o) > and i^Mo C 7r"^(A). 

We know from the construction that Leb(v4o) = m{AQ) > 0. Also, /'|v4o is 
injective, and since the Jacobian of is non-singular, so is the Jacobian of /, 
and Leb(/*^o) > 0. We have /*(^o) = ttF^Aq) C A, so Leb(^) > 0. This 
proves that 7r*m <C Leb. □ 

14 An example with non-Holder projection 

Finally, we briefly give an example of a system which exhibits a tower with 
non-Holder tower projection. 
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Figure 1: A system admitting a tower via a non-Holder scmi-conjugacy. 



Given < a < 6 < 1, and a > 1, we define / : [0, 1] by 

r 1 - (1 - &)(-loga)"(-log(a- x))-" xe[0,a] 

[ 6-^exp{(l-6)""\loga)(l-x)-""'} xe[b,l] 

(See figure n) Note that the map has unbounded derivative near a, and that 
a maps onto the critical point at 1. It is easy to check that / is monotone 
increasing on each interval, and that / has a Markov structure on the intervals 
[0,a], {a,b), [b,l], and could be represented by a subshift of finite type. A 
straightforward, if tedious, calculation shows that for x G [0,a], y G [b, 1], 

fix) = a(l-6)(-loga)"(a-x)-i(-log(a-x))-("+i); 

fiv) = --a-i(l-6r"\loga)(l-yr 
a 

exp{(l-6)""(loga)(l-2;)-""}; 

f'ifix)) = -a-\l-br\^\oga)-'^{a-x)-\-\og{a-x)r+'; 
a 

a 

Taking Aq = [0,5], V = {[0, a], (a, 6)} with R{[0,a]) ^ 2, R{{a,b)) = 1, 
it is clear that the conditions for / to admit a tower F : A O are satisfied. 
For x,y £ [0, a], we have that |x — y| w jf -^^g k and consider 

— /(t/)| for x,y € [0, o) with s{x, y) = k, then for y close to a, we have 

« (fclog(6/a) + C)-" 

for some C. This determines the regularity of the tower projection tt, which is 
in particular not Holder continuous. However, if we take tp G (i?l,7) for some 
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7 S (0, 1], then for a;, y e A, 

\ip o 7r(a;) — tjj o 7r(j/)| < Cs{x, y)~°''^, 

so V ° € {Vi, 07). This means that for observables in 7) for sufficiently 
large 7 (dependent on TZooii^)), we get polynomial decay of correlations. Also 

note that if we take with TZeitp) = O{0^ ), for some d € (0, 1), then 

IV o w{x) -ipo 7r(y)| < C9^'^<'''y^\ 

and o 77 G (VI). So we get exponential mixing for some non-trivial subset of 
(i?l). We may find observable classes corresponding to (V2), (FS) in similar 
fashion. 
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